Maize acetyl CoA carboxylase encoding DNA clones

ABSTRACT

PCT No. PCT/GB92/02205 Sec. 371 Date Aug. 18, 1994 Sec. 102(e) Date Aug. 18, 1994 PCT Filed Nov. 27, 1992 PCT Pub. No. WO93/11243 PCT Pub. Date Jun. 10, 1993Clones are disclosed comprising at least part of a DNA sequence of a gene encoding maize acetyl CoA carboxylase, or a sequence showing substantial homology therewith, flanked by heterologous DNA. DNA from such clones may be used to isolate similar sequences in other plants. Such sequences may be used to transform plants to give a variety of useful effects, including resistance to herbicides such as fluazifop, and modified oil-bearing properties.

FIELD OF THE INVENTION

The present invention relates to novel DNA clones and uses thereof: including the use thereof to transform plants and genetically modified plants so obtained.

SUMMARY OF THE INVENTION

Clones according to the present invention comprise at least part of a DNA sequence of a gene encoding maize acetyl CoA carboxylase, or a sequence showing substantial homology therewith, flanked by heterologous DNA.

Acetyl CoA carboxylase (ACCase), a biotin-containing enzyme, catalyses the first step in the fatty acid biosynthesis pathway, namely the carboxylation of acetyl CoA to give malonyl CoA. Accordingly clones according to the invention have a number of uses, depending on their exact nature.

For example, clones according to the invention containing partial coding sequences of an ACCase gene may be used to probe plant DNA (for example genomic DNA, or cDNA libraries prepared from messenger RNA) to obtain other clones according to the invention. These other clones may be different or longer; they may be substantially complete clones of the maize ACCase, or part or all of genes coding for corresponding or similar enzymes in maize or other plants (both monocots and dicots). Partial sequences according to the invention may be used (generally but not necessarily in reverse orientation) in combination with a regulator sequence functional in plants to construct an expression cassette. This expression cassette can then be used to transform plants to down-regulate the production of ACCase enzyme. This can alter the composition of seed or other plant parts, for example enabling oil-bearing plants (oilseed rape, sunflower, oilpalm) having a lower or otherwise modified oil content to be produced.

Clones containing longer or substantially complete sequences of an ACCase gene may also be used to form similar expression cassettes. Such coding sequences may be used in plants (either by using more powerful promoters or by inserting extra copies of the gene) to promote the expression or overexpression of ACCase, for example leading to plants with an enhanced oil content.

Partial clones according to the invention can be used to probe plant DNA to recover the promoter of the ACCase gene. This promoter may then be used to generate RNA in a tissue-specific or developmentally-regulated fashion. The RNA so generated may inhibit expression of ACCase, or another gene; or be mRNA that generates ACCase, or another protein.

In monocotyledonous plants the ACCase enzyme is inhibited by certain classes of grass-weed herbicides (aryloxyphenoxypropionates such as fluazifop, alkylketones) whereas in dicots the enzyme is relatively tolerant to these herbicides. Accordingly in a further aspect the present invention comprises monocotyledonous plants resistant to herbicides active by interfering with the fatty acid synthesis pathway, said plants having been obtained by transformation with constructs according to the invention adapted to express ACCase enzyme. Such plants may be made resistant in various ways, for example:

1. By overexpression of monocotyledonous ACCase for example using powerful promoters or multiple gene insertions;

2. By expression of dicotyledonous ACCase;

3. Possibly, by expression of a resistant form of maize ACCase. The gene for resistant maize ACCase might be obtained by mutagenesis and selection in a suitable host or by tissue culture of the plant in the presence of the appropriate herbicide.

The purpose in providing crop plants which resist the action of a herbicide is to facilitate the destruction of weeds growing between the plants by the overall application of a herbicidally effective concentration of a herbicide which would destroy the crop plant in its normal, that is herbicide-sensitive, state. Such resistant plants are also useful where there has been any short term carry-over of herbicide from a previous crop. The development of crops with novel types of herbicide resistance is agronomically useful, giving the farmer additional options to obtain an improved harvest by means which may be safer, cheaper or more effective than those currently available. In the present case, rendering maize or small-grain cereals such as wheat and barley resistant to the herbicide fluazifop would enable this herbicide to be used to combat wild oats in these crops--a very effective advance over what is available at present.

We have prepared clones according to the invention by extracting ACCase from plant material, purifying it, preparing antibodies thereto by challenging a mammalian immune system therewith, immuno-purifying the antibodies and using them to select clones from a plant DNA library. Clones A3 and A4 prepared by this process (designated A3 ACC cDNA and A4 ACC cDNA, respectively) were deposited on 13 Nov. 1991 with the Australian Government Analytical Laboratory, 1 Suakin Street, Pymble NSW2073.

Accordingly, a simpler process for the preparation of clones according to the invention comprises reproducing the clones designated A3 ACC cDNA and A4 ACC cDNA available from the Australian Government Analytical Laboratory, or relatives or descendants of such clones.

BRIEF DESCRIPTION OF THE DRAWINGS

The invention will now be further described with reference to the following Examples, and to the drawings, in which:

FIGS. 1a-1b are an SDS-PAGE diagram of partially purified maize ACCase, in which Lane A is a Coomassie Blue stained gel, and lane B is a streptavidin-phosphatase treated electroblot of ACCase showing the presence of biotin-containing polypeptides;

FIG. 2 compares the digestion patterns of an A4-like cDNA (A12) and an A3-like cDNA (A34) cut with a number of different restriction enzymes; the odd numbered lanes show ACC12 and the even numbered lanes show ACC34. The enzymes used are Pst1 (1,2), BamH1+Bg1II (3,4), HindIII (5,6), HincII (7,8), EcoR1 (9,10), AccI (11,12), PvuII (13,14), PvuII+HincII (15,16), PvuII+EcoRV (17,18). Marker lanes are λ DNA digested with HindIII.

FIGS. 3(a) to 3(j), taken together, are the nucleotide and deduced amino acid sequence of the A3 ACC cDNA maize clone (SEQ ID NO:1 and SEQ ID NO:2);

FIGS. 4a-4b are the nucleotide sequence and deduced amino acid sequence of the 5' ends of the A3-like clones A10 (a) and A34 (b) (SEQ ID NO:3 through SEQ ID NO:6);

FIGS. 5a-5c are the nucleotide sequence and deduced amino acid sequence of the 5' ends of the A4-like clones A12 (a), A4 (b) and the 3' end of A12 (c) (SEQ ID NO:7 through SEQ ID NO:11).

DESCRIPTION OF PREFERRED EMBODIMENTS

In the Examples we describe the use of affinity purified antibodies to maize ACCase to select clones from a maize cDNA expression library; and we show that three of the clones obtained encode maize ACCase. Confirmatory evidence for the identity of the clones is as follows:

(i) amino acid sequences, deduced from DNA sequencing of one of the clones, were found to be up to 50% identical with parts of the rat and chicken ACCase sequence.

(ii) the longest cloned cDNA insert (4.5 kb) is considerably longer than the predicted length of any mRNA encoding known biotin-containing enzymes in plants, other than that encoding ACCase.

We believe that the A3 and A4 clones represent two different maize ACCase genes. We have also found that the maize ACCase clones hybridise to Southern blots of Arabidopsis DNA. This enables us to obtain from such Arabidopsis DNA probes suitable for isolation of ACC-type coding sequences and genes from other plants, both dicots and monocots.

Material and Methods

Partial Purification of Acetyl CoA Carboxylase From Maize Leaves

Plant Material. Seedlings of maize (Zea mays Dekalb XL81) were grown in sterile soil, in a glasshouse with the temperature maintained between 20° C. and 30° C. under natural illumination. Leaves of 2 to 4 week old plants were used for enzyme preparation.

Purification. Extraction and purification procedures were carried out at 0°-4° C.

Extraction. Leaf material (160 g) was homogenised in a Waring blender with 300 ml of medium consisting of 0.1M Tris-HCl buffer (pH 8.0), 10 mM MgCl₂, 1 mM EDTA, 20 mM 2-mercaptoethanol, 0.2 mM PMSF, 2 mM benzamidine hydrochloride, and 2% (w/v) of polyvinylpolypyrrolidone. The homegenate was filtered through two layers of muslin and two layers of Miracloth and the filtrate (380 ml) centrifuged at 13000 g for 30 minutes.

Ammonium sulphate and polyethylene glycol fractionation steps. To the centrifuged extract (365 ml) was added sufficient solid ammonium sulphate with stirring to bring the mixture to 40% saturation. After 40 minutes gentle stirring the mixture was centrifuged at 13000 g for 30 minutes and the supernatant discarded. The protein pellet was re-dissolved in a medium consisting of 20 mM Tris-HCl (pH 8.0) 10 mM MgCl₂, 1 mM EDTA, 10 mM dithiothreitol, 0.2 mM PMSF, and 10% (v/v) glycerol. To this solution (204 ml) was added solid PEG (8 g per 100 ml) with stirring. After 40 minutes gentle stirring the suspension was centrifuged as above and the pellet discarded. More PEG was added to the supernatant (12 g per 100 ml) and, after stirring for 40 minutes, the mixture was again centrifuged and the pellet kept.

Affinity chromatography on Matrix gel Orange A. The precipitated protein was re-dissolved in buffer consisting of 20 mM Hepes KOH (pH 6.8), 10 mM MgCl₂, 1 mM EDTA, 10 mM DTT, and 10% (v/v) glycerol and clarified at 10000 g for 20 minutes. The supernatant (100 ml) was passed through a 40 ml column of orange A gel (previously equilibrated with column buffer consisting of the above dissolving medium but containing 20 mM 2-mercaptoethanol instead of DTT) at a flow rate of approximately 0.4 ml per minute. The column was then washed with 800 ml of column buffer at about 1 ml per minute to remove unbound protein. Acetyl CoA carboxylase was subsequently eluted from the column with 100 ml of 0.5 mM CoA in dissolving buffer followed by more buffer. Fractions containing acetyl CoA carboxylase activity were pooled (100 ml) and concentrated by ultra-filtration over a PM30 membrane (Amicon Scientific) to a volume of 5.8 ml. The enzyme solution was divided into aliquots and rapidly frozen (in liquid nitrogen) before storage at -80° C.

Measurement of Enzyme Activity

Acetyl CoA carboxylase activity was measured at 30° C. by the enzyme and substrate dependent incorporation of radioactivity from NaH¹⁴ CO₃ into acid-stable products, based on published methods. Reaction mixtures contained (in a total volume of 200 μl): 100 mM Tris-HCl (pH 8.0), 50 mm KCl, 5 mm MgCl₂, 5 mM DTT, 2 mM ATP, 15 mM NaH¹⁴ CO₃ approx. 0.25 Ci per mole), 0.75 mM acetyl CoA (trilithium salt), and enzyme (2-10 mU). Reactions were initiated with enzyme and stopped after 6 minutes by adding 50 μl of 6M HCl. Portions of the reaction mixture were spotted into filter paper discs, which were then dried and the acid-stable ¹⁴ C reaction products measured by scintillation counting. The amount of enzyme which catalyses conversion of 1 μmol of substrate per min is defined as 1 unit.

Protein Determination

Protein concentration was measured by Coomassie Blue dye binding according to published methods and using bovine serum albumin as standard.

SDS-Polyacrylamide Gel Electrophoresis/electroblotting

SDS-PAGE and electroblotting to nitro-cellulose or PVDF membranes were carried out according to published methods.

Electrophoretic Isolation of ACCase Subunit

3.7 ml of a preparation of ACCase (partially purified essentially by the above procedure) containing about 680 μg of protein were concentrated to 100 μl using centrifugal filters (Amicon Centricon 100). The concentrated protein solution was added to 80 μl of a digestion mixture containing 125 mM Tris-HCl (pH 6.7), 2% (w/v) SDS, 10% (w/v) glycerol and 0.01% (w/v) bromophenol blue, and the solution heated at 98° C. for 5 minutes. The digest was applied to several tracks of a 7.5% polyacrylamide gel and the protein subunits separated by SDS-PAGE. The gel was subsequently washed with 5 mM Tris-HCl (pH 6.7) then stained with Coomassie Blue G (1% w/v in water) before destaining with water. Segments of the stained gel containing ACCase subunit were cut out.

Chemical Cleavage of ACCase Subunit and Isolation of Fragments

Gel segments containing ACCase subunit were gently agitated in 4 ml of UHA (urea/water/acetic acid; 25 g:25 ml:25 ml) for 80 min with one change of solution. The segments were then treated with 4 ml of a solution of 0.2% (w/v) N-chlorosuccinimide in UHA for 60 minutes to cleave the ACCase subunit at tryptophan residues, followed by washing with 5 mM Tris HCl, pH 6.7 (30 min). The gel segments were then equilibrated with a solution containing 125 mM Tris-HCl (pH 6.7), 5% (w/v) PEG, 20 mM DTT, 1% (w/v) SDS, for 90 min, finally heating the segments in solution to 95° C. for 6 min. The gel segments, containing cleaved fragments of ACCase subunit, were then loaded onto several tracks of a 16% acrylamide gel and the fragments separated by SDS-PAGE. After separation, the polypeptide fragments were electroblotted onto a PVDF membrane by published methods and the membrane stained with Amido black. Portions of the membrane containing major polypeptide fragments were excised and destained exhaustively with 10 mM Tris base, then washed with water before being submitted for N-terminal amino acid sequencing.

N-terminal Amino Acid Sequencing

N-terminal amino acid sequencing was carried out by established techniques at the Biomolecular Resources Facility of the Australian National University.

Production of Antibodies to Maize ACCase

Orange A-purified maize ACCase (˜50 μg protein in 1 ml Freund's complete adjuvant) was injected into the hind leg muscle of a rabbit. Four weeks later a further 50 μg of the maize ACCase in incomplete Freund's adjuvant was similarly injected. The rabbit was bled at two to three week intervals and serum collected. The first serum sample did not contain detectable antibodies to maize ACCase as judged by binding to maize ACCase after SDS-PAGE and immunoblotting or inhibition of maize ACCase activity. The second and subsequent bleedings yielded antiserum that bound to maize ACCase as judged by the two criteria listed above. This antiserum was used as the starting material for purification of affinity-purified antibody.

Affinity-purification of Antibodies to Maize ACCase

Partially purified maize ACCase (500 μl) was resolved from contaminants by preparative SDS polyacrylamide gel electrophoresis. The membranes were stained with Amido Black and the ACCase band at ˜200 kDa was cut out. Residual adsorption sites on the membrane were blocked by incubation of the membrane in TBS (20 mM Tris-KCl, pH 7.5, 150 mM NaCl) containing 5% (w/v) powdered milk. Antibodies to maize ACCase were adsorbed to the membrane by incubating the ACCase-coated membrane in 1 ml dilute antiserum (0.5 ml antiserum 0.5 ml TBS) at room temperature for 1 hour. The membrane was removed, washed for 5 minutes in 10 ml TBS+0.05% Tween 20 then twice (5 min each) in 10 ml TBS. The membrane-bound antibodies were eluted by incubation with 1 ml 0.1M glycine/HCl pH2.6 for 3 minutes and neutralised. This cycle of adsorption and elution was repeated two more times using the same membrane and antiserum.

Before use the affinity-purified antibody was diluted to 20 ml with TBS. This antibody preparation was further depleted of non-specific antibodies by incubation with nitro-cellulose coated with an E. coli λ phage lysate. Finally, the affinity-purified antibody was made 10% with respect to horse serum albumin to further minimise non-specific binding.

Immunoprecipitation of ACCase by Rabbit Antiserum

A series of dilutions of serum from control and ACCase-immunised rabbits were prepared. Enzyme extract (50 μl) supplemented with 0.02% (v/v) Triton X-100 was mixed with diluted serum (50 μl) and incubated at 30° C. For maize, the enzyme was Orange-A purified while for Amaranthus edulis a G-25 treated crude leaf extract prepared from 1 g of tissue in 2 ml of solution containing 0.1M Tris-HCl (pH 8.0), 10 mM MgCl₂, 1 mM EDTA, 10 mM DTT, 0.01% (v/v) Triton X-100 was used. After 50 minutes incubation of serum with maize enzyme, immune complexes were removed by adding 20 μl of a suspension of Protein A-Sepharose (0.2 g per ml in 0.1M KH₂ PO₄ buffer pH 7.0) and centrifuging (11600 g, 5 min) after a further 60 min at room temperature. Enzyme activity remaining in the supernatant was measured. Incubation of Amaranthus edulis extract with serum was for 160 min before removal of immunoprecipitates with Protein A and centrifugation (5 min, 11500 g). The A. endulis mixtures were left a further 16 hours at 4° C. and centrifuged again before assay of remaining enzyme in the supernatant.

Maize Leaf cDNA Library

This was a gift from Dr A Barkan, Department of Botany, University of California, Berkeley. Messenger RNA was isolated from leaves of two week old Zea mays (B73) seedlings. cDNA was synthesised using a ZAP cDNA synthesis kit (Stratagene). The cDNA was ligated into the EcoRI site (5' end of the cDNA) and the XhoI site (3' end of the cDNA) of lambda expression vector λZAP. The library contained 8×10⁵ recombinants.

Screening of Maize cDNA Library

The maize λZAP cDNA library was screened for clones producing fusion proteins that would bind to antibodies purified by adsorption to purified maize ACCase.

Phage was adsorbed to E. coli Y1090 cells mixed with 0.8% agarose/10 mm MgSO₄ /0.02% maltose in LB, plated at a density of about 100 plaques cm². The plates were incubated at 37° C. for about 4 hours; then nitro-cellulose filters impregnated with IPTG were placed on the agarose surface and the incubation continued at 37° C. overnight. The filters were removed, washed in TBS buffer containing 5% (w/v) powdered milk to block the surface of the nitro-cellulose before screening with affinity purified antibody. The filters (6×137 mm diam) were incubated in 20 ml of the affinity purified antibody for 2 hours at room temperature. The filters were washed in TBS containing 0.05% Tween 20 three times, then in goat anti-rabbit immunoglobulin coupled to alkaline phosphatase (1:7500 in TTBS, 0.2% Tween 20 in TBS) for 1 hour at room temperature. The filters were freed of unbound 2nd antibody by washing twice in TTBS then once in TBS. The plaques expressing ACCase-fusion proteins were visualised by incubating on an alkaline phosphatase reaction mixture containing BCPIP and NBT (150 μl/ml BCPIP, 150 μl/ml NBT in 100 mM Tris-Hcl, pH 9.5, 100 mM NaCl, 1 mm MgCl₂). Plaques giving a positive reaction were further purified by repeated rounds of screening using the same antibody solution until all plaques on a plate gave a positive signal. Three positive plaques were obtained by this procedure.

Results

Following the procedure detailed above, ACCase was purified over 100-fold from maize leaves to a specific activity of 3-4 units per mg protein (Table 1). Analysis by SDS-PAGE of the enzyme at different stages of purification showed that ACCase consisted of polypeptide subunits of about 200-220 kDa and, at the final stage, was about 70% pure (FIG. 1). The presence of biotin in the ACCase subunit was demonstrated by Western blotting of the SDS-polyacrylamide gel and assaying with streptavidin-phosphatase. A number of degradation products of ACCase were also shown by this sensitive procedure.

Attempts to determine the N-terminal amino acid sequence were unsuccessful, presumably because it was blocked. However, limited internal sequence data has been by cleaving the electrophoretically purified subunit with N-chlorosuccinimide, re-electrophoresing the products and electroblotting to PVDF membrane for N-terminal sequence analysis. The three sequences obtained were 8,4 and 8 amino acids in length and were 62.5%, 100% and 75% similar to sequences in chicken ACCase (Table 2).

Orange-A purified ACCase was injected into a rabbit and, following two booster injections, the rabbit serum was found to form immune complexes with maize ACCase activity. ACCase activity from Amaranthus was also immunoprecipitated by similar concentrations of antiserum.

ACCase antibodies were affinity purified by adsorption to and elution from a PVDF membrane containing electrophoretically purified ACCase subunit. This purified antibody preparation was used to screen the maize cDNA expression library.

Isolation of Maize ACCase cDNA Clones

Approximately 90,000 plaques of a maize cDNA expression library constructed in the vector λZAP (Strategene) were screened with affinity-purified ACCase antibody. Of the seven clones picked in the primary screening, three remained positive through two further rounds of purification. Two of these (A1 and A3) gave a much stronger signal than the third (A4).

Analysis of the Bluescript phagemids released from the λZAP clones indicated they were carrying DNA inserts of 4.0 (A1), 4.4 (A3) and 4.4 (A4) kb. Restriction mapping showed the 4.0 and 4.4 kb inserts of A1 and A3 to be almost identical, except for the extra 400 bp at the 5' end. The map of A4 was different but the A4 insert cross-hybridised to A1/A3.

Re-screening of the λZAP library with the A3 insert yielded a further six positively hybridising clones. Restriction mapping established that four of these were like A1/A3 and two were like A4. FIG. 2 shows the digestion pattern for an A4-like cDNA (A12) and an A3-like cDNA (A34) cut with a number of different restriction enzymes. These results indicate there are at least two genes for ACCase in maize.

The DNA sequence of the A3 ACC cDNA clone was determined (FIG. 3). The deduced amino acid sequence was 37% identical (58% similar) to the rat/chicken ACCase sequence and, over one stretch of 100 residues, the sequences were more than 60% identical.

The sequence of most of the A1 ACC cDNA was determined and found to be identical to the A3 sequence. Additional sequence information for the 5' ends of longer A3-like clones are given in FIGS. 4(a,b). In the case of clone A10/A49, the sequence includes the motif that encodes the conserved biotin-binding site, met-lys-met (FIG. 4b) of ACCase.

Analysis of the 5' ends of two A4-like clones, A12 and A4 ACC cDNAs (FIGS. 5a,b), and of the 3' end of A12 (FIG. 5c) showed there are a significant number of differences between the sequences of A3- and A4-like cDNAs. This supports the conclusion drawn from the restriction digest patterns that there must be at least two ACCase genes in maize.

Hybridisation of the A3 cDNA to Southern blots of EcoR1 and BamH1 digested maize DNA gave 3 and 2 bands respectively. It is likely therefore that there are no more than two ACCase genes in maize. The maize cDNA also hybridised to Arabidopsis DNA under low stringency conditions.

Hybridisation of the A3 cDNA insert to a northern blot of maize RNA gave just one band of size 8.0-8.5 kb.

Discussion

The size of the maize ACCase subunit (˜220 kDa) is similar to that of the enzyme from rat and chicken. The mRNA for such a protein would be expected to be at least 6.5 kb and according to our determination is in fact 8.0-8.5 kb for maize. This is considerably longer than the longest cDNAs (5.5 kb for A3-like and 5.7 kb for A4-like clones) described here. These cDNAs, however, will provide the means for isolating full-length cDNAs. The cDNAs may be used as primers for 5' extension against maize mRNA as a template or as probes for isolating the genes from a maize genomic library.

The maize ACCase cDNA cross-hybridised to Arabidopsis DNA and it therefore presents a means for isolating the gene from an Arabidopsis genomic library. The same heterologous probe may be used for isolating ACCase genes from a range of plants, including both monocots and dicots, for example oil-seed rape.

There are clearly two ACCase genes in maize. As only one band is observed when a northern blot of maize leaf RNA is hybridised with the A3 cDNA it would seem the messenger RNAs for both genes are the same size. More A3-like than A4-like clones were selected from the maize leaf cDNA library so the A3 mRNA must be the more abundant. Of the three cDNA clones initially selected from the maize expression library by their reaction with antibodies to maize ACCase, A1 and A3 gave the strongest signal with the antibody and A4 the weakest. This could indicate that the A3-like cDNAs encode the grass-weed herbicide-sensitive form of ACCase. Whether the enzyme encoded by the A4-like cDNA is herbicide-sensitive or tolerant is unclear.

By defining the 5' end of a full-length maize cDNA it will be possible to isolate the promoter sequence from a maize genomic clone. Similarly it will be possible to isolate the promoter for the Arabidopsis gene and by reciprocal constructions to express the herbicide-tolerant Arabidopsis enzyme in herbicide-sensitive monocot species (for example, maize, wheat, barley, rice). This provides a method for producing monocotyledonous crop plants tolerant to herbicides (such as fluazifop) that are active by interfering with the fatty acid biosynthesis pathway.

Finally, the invention enables the use of a full-length cDNA clone to synthesise active ACCase in a transformed micro-organism culture, such as an E. coli or yeast system. Such a system would be useful for selecting herbicide-resistant forms of the ACCase enzyme, and for testing new compounds for herbicidal activity. It would also allow investigation of the nature of the herbicide-binding site and of the basis of at least one of the mechanisms responsible for resistance to grass-weed herbicides.

Note: FIG. 2 shows restriction digests of the A4-like (ACC12) and the A3-like (ACC34) maize ACCase cDNA clones. The oddnumbered lanes show ACC12 and the even-numbered ones show ACC34. The enzymes used are Pst1 (1,2), BamH1+Bg1II (3,4), HindIII (5,6), HincII (7,8), EcoR1 (9,10), AccI (11,12), PvuII (13,14), PvuII+HincII (15,16), PvuII+EcoRV (17,18). Marker lanes (M) are λ DNA digested with HindIII.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 11                                                  (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 4345 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        GTTTTTTTTTTTTTATTTCATGGCAGTCTGACGTGATTGAAACATTGCGGCATCAGCACA60                 GTAAAGACCTGCAGAAGGTTGTAGACATTGTGTTGTCTCACCAGGGTGTGAGGAACAAAG120                CTAAGCTTGTAACGGCACTTATGGAAAAGCTGGTTTATCCAAATCCTGGTGGTTACAGGG180                ATCTGTTAGTTCGCTTTTCTTCCCTCAATCATAAAAGATATTATAAGTTGGCCCTTAAAG240                CAAGTGAACTTCTTGAACAAACCAAACTAAGTGAACTCCGTGCAAGCGTTGCAAGAAGCC300                TTTCGGATCTGGGGATGCATAAGGGAGAAATGAGTATTAAGGATAACATGGAAGATTTAG360                TCTCTGCCCCATTACCTGTTGAAGATGCTCTGATTTCTTTGTTTGATTACAGTGATCGAA420                CTGTTCAGCAGAAAGTGATTGAGACATACATATCACGATTGTACCAGCCTCATCTTGTAA480                AGGATAGCATCCAAATGAAATTCAAGGAATCTGGTGCTATTACTTTTTGGGAATTTTATG540                AAGGGCATGTTGATACTAGAAATGGACATGGGGCTATTATTGGTGGGAAGCGATGGGGTG600                CCATGGTCGTTCTCAAATCACTTGAATCTGCGTCAACAGCCATTGTGGCTGCATTAAAGG660                ATTCGGCACAGTTCAACAGCTCTGAGGGCAACATGATGCACATTGCATTATTGAGTGCTG720                AAAATGAAAGTAATATAAGTGGAATAAGTGATGATCAAGCTCAACATAAGATGGAAAAGC780                TTAGCAAGATACTGAAGGATACTAGCGTTGCAAGTGATCTCCAAGCTGCTGGTTTGAAGG840                TTATAAGTTGCATTGTTCAAAGAGATGAAGCTCGCATGCCAATGCGCCACACATTCCTCT900                GGTTGGATGACAAGAGTTGTTATGAAGAAGAGCAGATTCTCCGGCATGTGGAGCCTCCCC960                TCTCTACACTTCTTGAATTGGATAAGTTGAAGGTGAAAGGATACAATGAAATGAAGTATA1020               CTCCTTCGCGTGACCGCCAATGGCATATCTACACACTAAGAAATACTGAAAACCCCAAAA1080               TGTTGCATAGGGTGTTTTTCCGAACTATTGTCAGGCAACCCAATGCAGGCAACAAGTTTA1140               CATCGGCTCAGATCAGCGACGCTGAAGTAGGATGTCCCGAAGAATCTCTTTCATTTACAT1200               CAAATAGCATCTTAAGATCATTGATGACTGCTATTGAAGAATTAGAGCTTCATGCAATTA1260               GGACAGGTCATTCTCACATGTATTTGTGCATACTGAAAGAGCAAAAGCTTCTTGACCTCA1320               TTCCATTTTCAGGGAGTACAATTGTTGATGTTGGCCAAGATGAAGCTACCGCTTGTTCAC1380               TTTTAAAATCAATGGCTTTGAAGATACATGAGCTTGTTGGTGCAAGGATGCATCATCTGT1440               CTGTATGCCAGTGGGAGGTGAAACTCAAGTTGGACTGTGATGGCCCTGCAAGTGGTACCT1500               GGAGAGTTGTAACTACAAATGTTACTGGTCACACCTGCACCATTGATATATACCGAGAAG1560               TGGAGGAAATAGAATCGCAGAAGTTAGTGGTACCATTCAGCCACTTCGTCAGCTGGACCA1620               TTGCATGGTGTGCACTGAATAATCCATATCAACCTTTGAGTGTGATTGATCTAAAGCGCT1680               GCTCTGCTAGGAACAACAGAACAACATATTGCTATGATTTTCCGCTGGCCTTTGAAACTG1740               CACTGCAGAAGTCATGGCAGTCCAATGGCTCTACTGTTTCTGAAGGCAATGAAAATAGTA1800               AATCCTACGTGAAGGCAACTGAGCTAGTGTTTGCTGAAAAACATGGGTCCTGGGGCACTC1860               CTATAATTCCGATGGAACCGCTGCTGGGCTCAACGACATTGGTATGGTCGCTTGGATCAT1920               GGAGATGTCAACACCTGAATTTCCCAATGGCAGGCAGATTATTGTTGTAGCAAATGATAT1980               CACTTTCAGAGCTGGATCATTTGGCCCAAGGGAAGATGCATTTTTTGAAACTGTCACTAA2040               CCTGGCTTGCGAAAGGAAACTTCCTCTTATATACTTGGCAGCAAACTCTGGTTCTAGGAT2100               TGGCATAGCTGATGAAGTAAAATCTTGCTTCCGTGTTGGATGGTCTGACGAAGGCAGTCC2160               TGAACGAGGGTTTCAGTACATCTATCTGACTGAAGAAGACTATGCTCGCATTAGCTCTTC2220               TGTTATAGCACATAAGCTGGAGCTAGATAGTGGTGAAATTAGGTGGATTATTGACTCTGT2280               TGTGGGCAAGGAGGATGGGCTTGGTGTCGAGAACATACATGGAAGTGCTGCTATTGCCAG2340               TGCTTATTCTAGGGCATATGAGGAGACATTTACACTTACATTTGTGACTGGGCGGACTGT2400               AGGAATAGGAGCTTATCTTGCTCGACTTGGTATACGGTGCATACAGCGTCTTGACCAGCC2460               TATTATTTTAACAGGGTTTTCTGCCCTGAACAAGCTCCTTGGGCGGGAAGTGTACAGCTC2520               CCACATGCAGCTTGGTGGTCCTAAGATCATGGCGACCAATGGTGTTGTCCACCTCACTGT2580               TCCAGATGACCTTGAAGGTGTTTCCAATATATTGAGGTGGCTCAGCTATGTTCCTGCAAA2640               CATTGGTGGACCTCTTCCTATTACCAAACCTCTGGACCCTCCAGACAGACCTGTTGCTTA2700               CATCCCTGAGAACACATGCGATCCACGTGCAGCTATCTGTGGTGTAGATGACAGCCAAGG2760               GAAATGGTTGGGTGGTATGTTTGACAAAGACAGCTTTGTGGAGACATTTGAAGGATGGGC2820               AAAAACAGTGGTTACTGGCAGAGCAAAGCTTGGAGGAATTCCTGTGGGCGTCATAGCTGT2880               GGAGACACAGACCATGATGCAGATCATCCCTGCTGATCCAGGTCAGCTTGATTCCCATGA2940               GCGATCTGTCCCTCGTGCTGGACAAGTGTGGTTCCCAGATTCTGCAACCAAGACCGCTCA3000               GGCATTATTAGACTTCAACCGTGAAGGATTGCCTCTGTTCATCCTGGCTAATTGGAGAGG3060               CTTCTCTGGTGGACAAAGAGATCTCTTTGAAGGAATTCTTCAGGCTGGGTCAACAATTGT3120               CGAGAACCTTAGGACATATAATCAGCCTGCTTTTGTGTACATTCCTATGGCTGGAGAGCT3180               TCGTGGAGGAGCTTGGGTTGTGGTCGATAGCAAAATAAATCCAGACCGCATTGAGTGTTA3240               TGCTGAAAGGACTGCCAAAGGTAATGTTCTCGAACCTCAAGGGTTAATTGAAATCAAGTT3300               CAGGTCAGAGGAACTCCAAGACTGTATGGGTAGGCTTGACCCAGAGTTGATAAATCTGAA3360               AGCAAAACTCCAAGATGTAAATCATGGAAATGGAAGTCTACCAGACATAGAAGGGATTCG3420               GAAGAGTATAGAAGCACGTACGAAACAGTTGCTGCCTTTATATACCCAGATTGCAATACG3480               GTTTGCTGAATTGCATGATACTTCCCTAAGAATGGCAGCTAAAGGTGTGATTAAGAAAGT3540               TGTAGACTGGGAAGAATCACGCTCGTTCTTCTATAAAAGGCTACGGAGGAGGATCGCAGA3600               AGATGTTCTTGCAAAAGAAATAAGGCAGATAGTCGGTGATAAATTTACGCACCAATTAGC3660               AATGGAGCTCATCAAGGAATGGTACCTTGCTTCTCAGGCCACAACAGGAAGCACTGGATG3720               GGATGACGATGATGCTTTTGTTGCCTGGAAGGACAGTCCTGAAAACTACAAGGGGCATAT3780               CCAAAAGCTTAGGGCTCAAAAAGTGTCTCATTCGCTCTCTGATCTTGCTGACTCCAGTTC3840               AGATCTGCAAGCATTCTCGCAGGGTCTTTCTACGCTATTAGATAAGATGGATCCCTCTCA3900               GAGAGCGAAGTTTGTTCAGGAAGTCAAGAAGGTCCTTGATTGATGATACCAACACATCCA3960               ACACAATGTGTGCATGTCACATCTTTTTGTTCTAGTACATACATAGAAGGATATTGCTTG4020               GTCTTGATTGATCATGTCTGATTTAAGTCGACTATTATTTCTTGGAATTTTCTTTTGGAC4080               CTGGTGCTATGGTTGATGGATGTATATTGGATATGTGCGTTCTGCCAGGTGTAAGCACAA4140               AGGTTTAGACAGACCGAGAGCAAGAGCGAGTGAACCTGTTCTGGTTTTGCAGTGGTTCAG4200               TAAGGCAGAAAGTTGTTAAACCGTAGTTCTGAGATGTATTACCAGTGGCGCCATGCTGTA4260               CTTTTAGGGTGTATAATGCGGATACAAATAAACAATTTAGCGGTTCATTAAAGTTTGAAC4320               TCAAATAAAAAAAAAAAAAAAAAAA4345                                                  (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1313 amino acids                                                   (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        PhePhePhePheIleSerTrpGlnSerAspValIleGluThrLeuArg                               151015                                                                         HisGlnHisSerLysAspLeuGlnLysValValAspIleValLeuSer                               202530                                                                         HisGlnGlyValArgAsnLysAlaLysLeuValThrAlaLeuMetGlu                               354045                                                                         LysLeuValTyrProAsnProGlyGlyTyrArgAspLeuLeuValArg                               505560                                                                         PheSerSerLeuAsnHisLysArgTyrTyrLysLeuAlaLeuLysAla                               65707580                                                                       SerGluLeuLeuGluGlnThrLysLeuSerGluLeuArgAlaSerVal                               859095                                                                         AlaArgSerLeuSerAspLeuGlyMetHisLysGlyGluMetSerIle                               100105110                                                                      LysAspAsnMetGluAspLeuValSerAlaProLeuProValGluAsp                               115120125                                                                      AlaLeuIleSerLeuPheAspTyrSerAspArgThrValGlnGlnLys                               130135140                                                                      ValIleGluThrTyrIleSerArgLeuTyrGlnProHisLeuValLys                               145150155160                                                                   AspSerIleGlnMetLysPheLysGluSerGlyAlaIleThrPheTrp                               165170175                                                                      GluPheTyrGluGlyHisValAspThrArgAsnGlyHisGlyAlaIle                               180185190                                                                      IleGlyGlyLysArgTrpGlyAlaMetValValLeuLysSerLeuGlu                               195200205                                                                      SerAlaSerThrAlaIleValAlaAlaLeuLysAspSerAlaGlnPhe                               210215220                                                                      AsnSerSerGluGlyAsnMetMetHisIleAlaLeuLeuSerAlaGlu                               225230235240                                                                   AsnGluSerAsnIleSerGlyIleSerAspAspGlnAlaGlnHisLys                               245250255                                                                      MetGluLysLeuSerLysIleLeuLysAspThrSerValAlaSerAsp                               260265270                                                                      LeuGlnAlaAlaGlyLeuLysValIleSerCysIleValGlnArgAsp                               275280285                                                                      GluAlaArgMetProMetArgHisThrPheLeuTrpLeuAspAspLys                               290295300                                                                      SerCysTyrGluGluGluGlnIleLeuArgHisValGluProProLeu                               305310315320                                                                   SerThrLeuLeuGluLeuAspLysLeuLysValLysGlyTyrAsnGlu                               325330335                                                                      MetLysTyrThrProSerArgAspArgGlnTrpHisIleTyrThrLeu                               340345350                                                                      ArgAsnThrGluAsnProLysMetLeuHisArgValPhePheArgThr                               355360365                                                                      IleValArgGlnProAsnAlaGlyAsnLysPheThrSerAlaGlnIle                               370375380                                                                      SerAspAlaGluValGlyCysProGluGluSerLeuSerPheThrSer                               385390395400                                                                   AsnSerIleLeuArgSerLeuMetThrAlaIleGluGluLeuGluLeu                               405410415                                                                      HisAlaIleArgThrGlyHisSerHisMetTyrLeuCysIleLeuLys                               420425430                                                                      GluGlnLysLeuLeuAspLeuIleProPheSerGlySerThrIleVal                               435440445                                                                      AspValGlyGlnAspGluAlaThrAlaCysSerLeuLeuLysSerMet                               450455460                                                                      AlaLeuLysIleHisGluLeuValGlyAlaArgMetHisHisLeuSer                               465470475480                                                                   ValCysGlnTrpGluValLysLeuLysLeuAspCysAspGlyProAla                               485490495                                                                      SerGlyThrTrpArgValValThrThrAsnValThrGlyHisThrCys                               500505510                                                                      ThrIleAspIleTyrArgGluValGluGluIleGluSerGlnLysLeu                               515520525                                                                      ValValProPheSerHisPheValSerTrpThrIleAlaTrpCysAla                               530535540                                                                      LeuAsnAsnProTyrGlnProLeuSerValIleAspLeuLysArgCys                               545550555560                                                                   SerAlaArgAsnAsnArgThrThrTyrCysTyrAspPheProLeuAla                               565570575                                                                      PheGluThrAlaLeuGlnLysSerTrpGlnSerAsnGlySerThrVal                               580585590                                                                      SerGluGlyAsnGluAsnSerLysSerTyrValLysAlaThrGluLeu                               595600605                                                                      ValPheAlaGluLysHisGlySerTrpGlyThrProIleIleProMet                               610615620                                                                      XaaThrAlaAlaGlyLeuAsnAspIleGlyMetValAlaTrpIleMet                               625630635640                                                                   GluMetSerThrProGluPheProAsnGlyArgGlnIleIleValVal                               645650655                                                                      AlaAsnAspIleThrPheArgAlaGlySerPheGlyProArgGluAsp                               660665670                                                                      AlaPhePheGluThrValThrAsnLeuAlaCysGluArgLysLeuPro                               675680685                                                                      LeuIleTyrLeuAlaAlaAsnSerGlySerArgIleGlyIleAlaAsp                               690695700                                                                      GluValLysSerCysPheArgValGlyTrpSerAspGluGlySerPro                               705710715720                                                                   GluArgGlyPheGlnTyrIleTyrLeuThrGluGluAspTyrAlaArg                               725730735                                                                      IleSerSerSerValIleAlaHisLysLeuGluLeuAspSerGlyGlu                               740745750                                                                      IleArgTrpIleIleAspSerValValGlyLysGluAspGlyLeuGly                               755760765                                                                      ValGluAsnIleHisGlySerAlaAlaIleAlaSerAlaTyrSerArg                               770775780                                                                      AlaTyrGluGluThrPheThrLeuThrPheValThrGlyArgThrVal                               785790795800                                                                   GlyIleGlyAlaTyrLeuAlaArgLeuGlyIleArgCysIleGlnArg                               805810815                                                                      LeuAspGlnProIleIleLeuThrGlyPheSerAlaLeuAsnLysLeu                               820825830                                                                      LeuGlyArgGluValTyrSerSerHisMetGlnLeuGlyGlyProLys                               835840845                                                                      IleMetAlaThrAsnGlyValValHisLeuThrValProAspAspLeu                               850855860                                                                      GluGlyValSerAsnIleLeuArgTrpLeuSerTyrValProAlaAsn                               865870875880                                                                   IleGlyGlyProLeuProIleThrLysProLeuAspProProAspArg                               885890895                                                                      ProValAlaTyrIleProGluAsnThrCysAspProArgAlaAlaIle                               900905910                                                                      CysGlyValAspAspSerGlnGlyLysTrpLeuGlyGlyMetPheAsp                               915920925                                                                      LysAspSerPheValGluThrPheGluGlyTrpAlaLysThrValVal                               930935940                                                                      ThrGlyArgAlaLysLeuGlyGlyIleProValGlyValIleAlaVal                               945950955960                                                                   GluThrGlnThrMetMetGlnIleIleProAlaAspProGlyGlnLeu                               965970975                                                                      AspSerHisGluArgSerValProArgAlaGlyGlnValTrpPhePro                               980985990                                                                      AspSerAlaThrLysThrAlaGlnAlaLeuLeuAspPheAsnArgGlu                               99510001005                                                                    GlyLeuProLeuPheIleLeuAlaAsnTrpArgGlyPheSerGlyGly                               101010151020                                                                   GlnArgAspLeuPheGluGlyIleLeuGlnAlaGlySerThrIleVal                               1025103010351040                                                               GluAsnLeuArgThrTyrAsnGlnProAlaPheValTyrIleProMet                               104510501055                                                                   AlaGlyGluLeuArgGlyGlyAlaTrpValValValAspSerLysIle                               106010651070                                                                   AsnProAspArgIleGluCysTyrAlaGluArgThrAlaLysGlyAsn                               107510801085                                                                   ValLeuGluProGlnGlyLeuIleGluIleLysPheArgSerGluGlu                               109010951100                                                                   LeuGlnAspCysMetGlyArgLeuAspProGluLeuIleAsnLeuLys                               1105111011151120                                                               AlaLysLeuGlnAspValAsnHisGlyAsnGlySerLeuProAspIle                               112511301135                                                                   GluGlyIleArgLysSerIleGluAlaArgThrLysGlnLeuLeuPro                               114011451150                                                                   LeuTyrThrGlnIleAlaIleArgPheAlaGluLeuHisAspThrSer                               115511601165                                                                   LeuArgMetAlaAlaLysGlyValIleLysLysValValAspTrpGlu                               117011751180                                                                   GluSerArgSerPhePheTyrLysArgLeuArgArgArgIleAlaGlu                               1185119011951200                                                               AspValLeuAlaLysGluIleArgGlnIleValGlyAspLysPheThr                               120512101215                                                                   HisGlnLeuAlaMetGluLeuIleLysGluTrpTyrLeuAlaSerGln                               122012251230                                                                   AlaThrThrGlySerThrGlyTrpAspAspAspAspAlaPheValAla                               123512401245                                                                   TrpLysAspSerProGluAsnTyrLysGlyHisIleGlnLysLeuArg                               125012551260                                                                   AlaGlnLysValSerHisSerLeuSerAspLeuAlaAspSerSerSer                               1265127012751280                                                               AspLeuGlnAlaPheSerGlnGlyLeuSerThrLeuLeuAspLysMet                               128512901295                                                                   AspProSerGlnArgAlaLysPheValGlnGluValLysLysValLeu                               130013051310                                                                   Asp                                                                            (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 400 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        CAAATACACAATTGAATCTGTAAGGACTGGACATGGTAGCTACAGGTTGAGAGTGAATGA60                 TTCAACAGTTGAAGCGAATGTACAATCTTTATGTGATGGTGGCCTCTTAATGCAGTTGGA120                TGGAAACAGCCATGTAATTTATGCAGAAGAAGAAGCTGGTGGTACACGGCTTCAGATTGA180                TGGAAAGACATGTTTATTGCAGAATGACCATGATCCATCAAAGTTATTAGCTGAGACACC240                CTGCAAACTTCTTCGTTTCTTGGTTGCTGATGGTGCTCATGTTGATGCGGATGTACCATA300                CGCGGAAGTTGAGGTTATGAAGATGTGCATGCCTCTCTTGTCACCTGCTTCTGGTGTCAT360                TCATTGTATGATGTCTGAGGGCCAGGCATTGCAGGCTGGT400                                    (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 133 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        LysTyrThrIleGluSerValArgThrGlyHisGlySerTyrArgLeu                               151015                                                                         ArgValAsnAspSerThrValGluAlaAsnValGlnSerLeuCysAsp                               202530                                                                         GlyGlyLeuLeuMetGlnLeuAspGlyAsnSerHisValIleTyrAla                               354045                                                                         GluGluGluAlaGlyGlyThrArgLeuGlnIleAspGlyLysThrCys                               505560                                                                         LeuLeuGlnAsnAspHisAspProSerLysLeuLeuAlaGluThrPro                               65707580                                                                       CysLysLeuLeuArgPheLeuValAlaAspGlyAlaHisValAspAla                               859095                                                                         AspValProTyrAlaGluValGluValMetLysMetCysMetProLeu                               100105110                                                                      LeuSerProAlaSerGlyValIleHisCysMetMetSerGluGlyGln                               115120125                                                                      AlaLeuGlnAlaGly                                                                130                                                                            (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 230 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        GTTGAATTTTTACCATGGAAAAAACGAGGACTTTCCATCCAAGTTGCTAAGAGACATCAT60                 TGAGGAAAATCTTTCTTATGGTTCAGAGAAGGAAAAGGCTACAAATGAGAGGCTTGTTGA120                GCCTCTTATGAACCTACTGAAGTCATATGAGGGTGGGAGAGAGAGCCATGCACATTTTGT180                TGTCAAGTCTCTTTTCGAGGAGTATCTTACAGTGGAAGAACTTTTTAGTG230                          (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 76 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        LeuAsnPheTyrHisGlyLysAsnGluAspPheProSerLysLeuLeu                               151015                                                                         ArgAspIleIleGluGluAsnLeuSerTyrGlySerGluLysGluLys                               202530                                                                         AlaThrAsnGluArgLeuValGluProLeuMetAsnLeuLeuLysSer                               354045                                                                         TyrGluGlyGlyArgGluSerHisAlaHisPheValValLysSerLeu                               505560                                                                         PheGluGluTyrLeuThrValGluGluLeuPheSer                                           657075                                                                         (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 250 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        GTTCAAGCTGAGAGGCCCCCATGGTATATCTCAGTGGTTGGAGGTGCTTTATATAAAACA60                 GTAACCACCAATGCAGCCACTGTTTCTGAATATGTTAGTTATCTCACCAAAGGCCAGATT120                CCACCAAAGCATATATCCCTTGTCAATTCAACAGTTAATTTGAATATAGAAGGGAGCAAA180                TACACAATTGAAACTGTCAGGACTGGGCATGGTAGGTACAAGTTGCGAATGAATGATTCA240                ACAGTTGAGG250                                                                  (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 83 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        ValGlnAlaGluArgProProTrpTyrIleSerValValGlyGlyAla                               151015                                                                         LeuTyrLysThrValThrThrAsnAlaAlaThrValSerGluTyrVal                               202530                                                                         SerTyrLeuThrLysGlyGlnIleProProLysHisIleSerLeuVal                               354045                                                                         AsnSerThrValAsnLeuAsnIleGluGlySerLysTyrThrIleGlu                               505560                                                                         ThrValArgThrGlyHisGlyArgTyrLysLeuArgMetAsnAspSer                               65707580                                                                       ThrValGlu                                                                      (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 330 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        TATGGTTCAGAGAAGGAAAAGGCTACAAATGAGCGACTTGTTGAGCCTCTTATGAACCTA60                 CTGAAGTCATATGAGGGTGGGAGAGAAAGCCATGCACATTTTGTTGTCAAGTCTCTTTTT120                GAGGAGTATCTTACCGTGGAAGAACTTTTCAGTGATGGCATTCAGTCTGACGTGATTGAA180                ACCTTGCGTCATCAGCACAGTAAAGACCTGCAGAAGGTTGTAGACATCGTGCTGTCTCAC240                CAGGGTGTGAGGAACAAAGCTAAGCTTGTAACAGCACTTATGGAAAAGCTGGTTTATCCA300                AATCCTGCATTGCAAGAAGCCTTTCAGATC330                                              (2) INFORMATION FOR SEQ ID NO:10:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 110 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                       TyrGlySerGluLysGluLysAlaThrAsnGluArgLeuValGluPro                               151015                                                                         LeuMetAsnLeuLeuLysSerTyrGluGlyGlyArgGluSerHisAla                               202530                                                                         HisPheValValLysSerLeuPheGluGluTyrLeuThrValGluGlu                               354045                                                                         LeuPheSerAspGlyIleGlnSerAspValIleGluThrLeuArgHis                               505560                                                                         GlnHisSerLysAspLeuGlnLysValValAspIleValLeuSerHis                               65707580                                                                       GlnGlyValArgAsnLysAlaLysLeuValThrAlaLeuMetGluLys                               859095                                                                         LeuValTyrProAsnProAlaLeuGlnGluAlaPheGlnIle                                     100105110                                                                      (2) INFORMATION FOR SEQ ID NO:11:                                              (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 319 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: both                                                         (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                       TGCTTGCCCCTTGATTGACCATGTCTGATCCTAAGTCGACCATTATTTCCTTGAAACTTC60                 CTTTCGGACGTGGTGCTATGGTTGATGAATTTGGATGTGTGCGTTCTGCCAGGTGTAAGC120                CCAAAGGTTTATACAGACCGAGTTAAGGTTAGGAAGAGCACGAGTGAACCTGTTCTGGTT180                TTGCAGTGGTTAAGGCAGAAAGTTGTTTCACTGTAGTTCTGAGATGTATTACCAGCGGCG240                CTGTAATTTTAGGGTGTATAATGCGGATGCTAGTAAACAATTGAGTGGTTCTTTAAAAAA300                AAAAAAAAAAAAAAAAAAA319                                                         __________________________________________________________________________ 

We claim:
 1. An isolated DNA sequence selected from the group consisting of that depicted in any one of FIGS. 3 (SEQ ID NO:1), 4(a) (SEQ ID NO:3); 4(b) (SEQ ID NO:5), 5(a) (SEQ ID NO:7), 5(b) (SEQ ID NO:9), and 5(c) (SEQ ID NO:11), wherein said sequence encodes a plant acetyl CoA carboxylase or portion thereof.
 2. The DNA sequence according to claim 1, flanked by heterologous DNA.
 3. The DNA sequence according to claim 2, wherein the heterologous DNA comprises a plant operable promoter.
 4. A clone selected from the group consisting of clones A3 and A4 which are deposited at the Australian Government Analytical Laboratory under the designations maize A3 ACC cDNA plasmid (Accession No. N91/074874) and maize A4 ACC cDNA plasmid (Accession No. N91/074875), respectively.
 5. A clone encoding a plant acetyl CoA carboxylase, said clone having been obtained by probing plant genomic or cDNA libraries, using as probe a DNA sequence of claim
 1. 6. A plant which has been transformed with the DNA sequence of claim 3, or the clone of claim
 5. 